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ABSTRACT 



Context. Several studies have proposed that the preferred model of the comoving spatial 3-hypersurface of the Universe 

may be a Poincare dodecahedral space (PDS) rather than a simply connected, infinite, flat space. 

Aims. Here, we aim to improve the surface of last scattering (SLS) optimal cross-correlation method and apply this to 

observational data and simulations. 

Methods. For a given "generalised" PDS orientation, we analytically derive the formulae required to exclude points 

on the sky that cannot be members of close SLS-SLS cross-pairs. These enable more efficient pair selection without 

sacrificing the uniformity of the underlying selection process. For a sufficiently small matched circle size a and a fixed 

number of randomly placed points selected for a cross- correlation estimate, the calculation time is decreased and the 

number of pairs per separation bin is increased. Using this faster method, and including the smallest separation bin 

when testing correlations, (i) we recalculate Monte Carlo Markov Chains (MCMC) on the five-year Wilkinson Microwave 

Anisotropy Probe (WMAP) data; and (ii) we seek PDS solutions in a small number of Gaussian random fluctuation 

(GRF) simulations in order to further explore the statistical significance of the PDS hypothesis. 

Results. For 5° < a < 60°, a calculation speed-up of 3-10 is obtained, (i) The best estimates of the PDS parameters 

for the five-year WMAP data are similar to those for the three-year data, (ii) Comparison of the optimal solutions 

found by the MCMC chains in the observational map to those found in the simulated maps yields a slightly stronger 

rejection of the simply connected model using a rather than the twist angle <j!>. The best estimate of a implies that, 

given a large-scale auto-correlation as weak as that observed, the PDS-like cross-correlation signal in the WMAP data 

is expected with a probability of less than about 10%. The expected distribution of <jf) from the GRF simulations is not 

uniform on [— tt, n] . 

Conclusions. Using this faster algorithm, we find that the previous PDS parameter estimates are stable to the update 

to five-year WMAP data. Moreover, for an infinite, fiat, cosmic concordance model with Gaussian random fluctuations, 

the chance of finding both (a) a large-scale auto-correlation as weak as observed and (b) a PDS-like signal similar to 

what is observed is less than about 0.015% to 1.25%. 

Key words, cosmology: observations - cosmic microwave background - cosmological parameters 
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1. Introduction 

During the past half-decade, attention has focussed on the 
Poincare Dodecahedral Space (PDS) as a potentially bet- 
ter model of comoving space than the infinite flat model 
(Luminet et al. 2003; Roukema et al. 2004; Aurich et al. 
2005a, 2005b; Gundermann 2005; Key et al. 2007; Niarchou 
& JaflFe 2007; CaiUerie et al. 2007; Lew & Roukema 2008; 
Roukema et al. 2008). If the hypothesis that the comoving 
spatial section of the Universe is a PDS is correct, then it 
should be possible to estimate the astronomical coordinates 
of the fundamental domain, as has been tried by Roukema 
et al. (2008) (hereafter, RBSG08). Moreover, successive im- 
provements in data and analysis methods should yield suc- 
cessively more precise and more accurate estimates of these 
astronomical coordinates. 

Here, we aim to improve on the method of optimising 
the cross-correlation ^c [Eq. (1), RBSG08] of cosmic mi- 
crowave background (CMB) temperature fluctuations be- 
tween copies of the surface of last scattering (SLS) pre- 
sented in Roukema et al. (2008) and also applied by Aurich 
(2008). We investigate whether or not this can lead to im- 



proved parameter estimates. We also test a small number 
of simulations to see whether the infinite flat model can 
reproduce the observational signal. 

In RBSG08, the method was introduced by pointing 
out that algebraically, it is an extension of using the iden- 
tifled circles principle first published by Cornish et al. 
(1996, 1998). Here, we make use of the fact that the cross- 
correlation method is not only an algebraic extension of 
the identified circles principle, but its relation to the iden- 
tified circles principle can also be interpreted in terms of 
identified annuli. This leads to algebraic/trigonometric re- 
lations that enable a faster calculation of cross-correlations, 
without sacrificing the uniformity of the underlying ran- 
dom selection of points on the sky. Moreover, use of these 
relations also increases the numbers of pairs per separa- 
tion bin, leading to less relative Poisson noise in individual 
cross-correlation estimates. The annulus interpretation and 
the associated relations are presented in Sect. 13.11 

In Sect. 13.21 use of this faster method to recalculate 
Monte Carlo Markov Chains (MCMC) for the five-year re- 
lease of the WMAP data is described. Since the new method 
increases the numbers of pairs per separation bin, the small- 
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est separation bin available when estimating correlations 
should not be as noisy as without this new method, so we 
include this smallest separation bin. 

Use of this faster method also makes it practical to per- 
form a consistent analysis of both the WMAP map and 
a small number of Gaussian random fluctuation (GRF) 
WMAP simulated maps, in order to estimate the expec- 
tation of a PDS-like signal under the assumption of a sim- 
ply connected model. In order to model this, we need to 
consider the following. 

One of the key motivations from WMAP data for study- 
ing multiply connected models has been the lack of struc- 
ture on angular scales larger than about 60°, as has been 
noted by several authors (e.g. Spergel et al. 2003; Luminet 
et al. 2003; Aurich et al. 2005b). As was noted in RBSG08, 
in particular in Eq. (10), if the PDS model is incorrect, 
then cross-correlations of temperature fluctuations at pairs 
of points that are implied (by the PDS hypothesis) to be 
spatially close should, on average, be much smaller than 
the auto-correlations at the same small spatial separation 
scale. This is because if the PDS model is wrong, then these 
pairs are in reality widely separated rather than closely sep- 
arated, so their correlations should, on average, be small. 

However, this is only a statistical statement: even for 
the case that the PDS model is physically wrong, it is pos- 
sible that some orientations of a PDS model, by chance, 
happen to give cross-correlations that violate Eq. (10) of 
RBSG08. How probable is it for chance large-scale correla- 
tions in particular directions to mimic a PDS-like signal? 
This clearly depends on the amplitude of the large length- 
scale auto-correlation. If this amplitude is low or high, then 
the chance of finding PDS-like signals using the optimal 
cross-correlation method should be low or high respectively. 
Since the range of matched circle sizes a studied in RBSG08 
included circles with a < 60°, most of the cross-correlations 
are from pairs of points separated observationally by angles 
>60° - Irad. 

The latter is the minimum scale above which Spergel 
et al. (2003) quantified the lack of temperature-temperature 
auto-correlations at large scales in the WMAP data, with 
their parameter S, defined 



5* = 



1/2 



[C{9)]^dcose, 



where C{9) is the two-point auto-correlation function at 
an angular separation of 9 on the SLS [Sect. 7, Eq. (9), 
Spergel et al. 2003]. Hereafter, we call this S^, in order to 
distinguish it from the S parameter used for matched circle 
analyses. The two parameters have very different meanings. 

The chance of observing S^ to be as small as that ob- 
served was estimated by Spergel et al. (2003) as 0.15% for 
an infinite, flat, cosmic concordance model with a fixed 
spectral index of density perturbations. Efstathiou (2004), 
using what should be a more accurate method if we as- 
sume a simply connected model, estimated the chances to 
be much higher, from 3% to 12.5% depending on which sky 
map is analysed and which (if any) galactic cut mask is 
used (Table 5, Efstathiou 2004). 

We can quantify the relation between large length-scale 
auto-correlations and the chance of a PDS signal in the 
case that the PDS is wrong by writing that the lower the 
observed value of S^, the less likely it is that a non-PDS 
model will give a PDS-like signal. Hence, since we expect 



that the two properties, a low S^ value and a low chance 
of a PDS-like cross-correlation signal occurring in a flat, 
infinite, cosmic concordance model, are related, it would 
not be useful to estimate their probabilities independently. 
Instead, what is of interest to investigate is the chance that 
a PDS-like cross-correlation signal occurs, given that S^ is 
as low as that which is observed. Conservatively, we can 
calculate an upper limit to this conditional probability if 
we use simulated sky maps with 5*^ values a little higher 
than the observed estimate of S^. Moreover, here we use 
the kp2 galactic cut mask, for which Efstathiou (2004) finds 
the highest probability for the observed S^ value to occur 
in an infinite, flat, cosmic concordance model, so we should 
obtain a conservative upper limit to the probability of both 
(a) an S^ value as low as that observed and (b) a PDS-like 
cross-correlation signal similar to that observed occurring. 
This is described in more detail in Sect. 13.31 

Results are presented in Sect. IH discussed in Sect. [5] 
and conclusions are given in Sect. [6l For general back- 
ground on spherical, multiply connected spaces, see Weeks 
(2001), Gausmann et al. (2001), Lehoucq et al. (2002) and 
Riazuelo et al. (2004). See the references cited above for 
background on cosmic topology. Comoving coordinates are 
used when discussing distances (i.e. "proper distances" at 
the present epoch, Weinberg 1972, equivalent to "confor- 
nial time" if c = 1) and the Hubble constant is written 
Ha = 100ft. km s-i/Mpc. 

2. Observations and simulations 

2.1. Observations 



The analysis described in Sect. 13. 21 uses the Internal Linear 
Combination (ILCj3 all-sky map of the five-year WMAP 
data (Hinshaw et al. 2008) and the foreground cleaned, 
Wiener filtered version of the same five-year data published 
by the Tegmark et al. (2003) group (TOH)0 The "kp2" 
mask to eliminate the Galactic Plane and other likely con- 
taminating regions, covering about 15% of the sky, is used 
throughout this paper unless otherwise notedO We do not 
smooth these maps. 



(1) 2.2. Simulations and associated observational map 



The analysis described in Sect. 13.31 uses a version of the 
WMAP map and sky simulations prepared consistently 
from three-year observational and simulated maps in the 
Q, V and W frequency bands, weighted by inverse noise as 
given in Hinshaw et al. (2003), and smoothed by a Gaussian 
of FWHM 1°, as described in Sect. 2 of Lew & Roukema 
(2008)13 This version of the observational map is referred 
to hereafter as "INC3" E 



^ http : //lambda . gsf c . nasa . gov/data/map/dr3/df p/ 1 
winap_ilc_5yr_v3 . f its 

•^ http : //spa ce .mit . edu/home/tegmark/wmap/| 
wiener5yr_ma p . f its| 

•^ Data file: http : //lambda . gsf c . nasa . gov/data/niap/dr2/ 1 
iancillary/wmap_kp2_r9_inask_3yr_v2 . f its ; 

map projection: http : //lambda . gsf c . nas a . gov/product/map/ 1 
current /map_images/f02_int_inask_b . png 

■* The maps can be downloaded from 

http://cosmo . torun . pl /GPLdownload/HCHC/ _ 
sims-from-LR08-project7] 



Inverse Noise Coadded 3-year 
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As mentioned above, the expected value of S^ from the 
infinite, flat, cosmic concordance model is higher than that 
observed. However, there is a (small) chance in the model 
that such a low S^ value occurs in any single realisation of 
the model, such as the one in which we live. Since we wish 
to test the chance that a PDS-like cross-correlation signal 
occurs given that S^ is as low as that which is observed, 
we generate simulations using the observational estimates 
of the spherical harmonic amplitudes Ci of the tempera- 
ture fluctuations as estimated in Hinshaw et al. (2007), 
rather than using the mean values implied by an infinite 
flat model. The phases of the spherical harmonics are ran- 
domised. Gaussian noise is simulated according to the prop- 
erties and scanning strategy of each differencing assembly 
and added to each simulated map. 

Since a lot of the large-scale power in what arc consid- 
ered as the best estimates of the cosmological signal in the 
WMAP data lie close to the Galactic Plane (e.g. Tegmark 
et al. 2003), simulations based on spherical harmonics with 
the same Ci 's as this signal but different phases will most of 
the time have high large-scale power which does not happen 
to lie inside the kp2 cut. Hence, simulations made with the 
Hinshaw et al. (2007) Ci estimates and masked with the 
kp2 mask will, in general, have S^ estimates larger than 
that which is actually observed for the part of the signal 
outside of the kp2 cut rather than approximately equal to 
it. 

This is not due to an error in the Hinshaw et al. (2007) 
estimates (assuming a simply connected modeO)- Even 
though these estimates for I < 30 are made using nearly 
the same kp2 cut that we use here (Hinshaw et al. 2007 
use a kp2 cut degraded in resolution), the spherical har- 
monics are nevertheless functions of the full sky. Hinshaw 
et al. (2007) use a maximum likelihood method which uses 
the information from the cut sky to infer the best estimate 
of the Ci values for these functions covering the full sky. 
Unsurprisingly, it implicitly extrapolates from the cut sky 
to the full sky, and recovers some of the information masked 
in the Galactic Plane. 

Since we wish to test the chance that a PDS-like cross- 
correlation signal occurs given that S*^ is as low as that 
which is observed, we have two obvious approaches to 
choose from in order to have simulations which are sta- 
tistically comparable with the observations. One approach 
would be to analyse the full sky for both observations and 
simulations. In this case, the S^ estimates should be approx- 
imately equal. However, a large part of the signal would be 
that from inside the Galactic Plane. The risk of implicitly 
testing the properties of foreground contaminants rather 
than of the cosmological signal would be high. 

The other approach is to analyse the cut sky (with the 
kp2 mask) for both observations and simulations. This de- 
creases the risk that our analysis will be affected by con- 
tamination from the Galactic Plane. However, for the cut 
sky, many of the simulations will have S^ higher than S^ of 
the observations. In this case, it will be necessary to select 
those simulations with the lowest values of 5*^ not greater 
than S^ of the observational map. 



For a multiply connected model, the distributions of the var- 
ious aim's are to some degree dependent on one other, so a 
method which assumes they are independent can at best give 
an approximate result. 




SLS 



SLS.' 



^sSLS 



Fig. 1. Relation of a spatial geodesic of length r2 joining a 
"close" pair of points in space to two copies of the SLS, and 
the angle a+ separating the member of the pair on the right- 
hand SLS from the dodecahedral face centre, i.e. from the spatial 
geodesic joining the two SLS copies. This figure shows a spatial 
geodesic "external" to the matched circles (intersection between 
the two copies of the SLS). The centres of the two copies of the 
SLS (2-spheres) are separated by {■k/5)Rc- See Sect. 13. Il 



This will not give an exact statistical match between 
simulations and observations. However, since greater S^ 
should increase the chance of cross-correlations occurring, 
this should give an upper limit for estimating the proba- 
bility of cross-correlations occurring in a simply connected 
model, i.e. a conservative estimate. This is the approach we 
adopt here. 

We estimate S^ for each of the observational and simu- 
lational using 



Sf 



2»-SLS „ 

[U{r)f d , 

rsLS ''SLS 



(2) 



where rsLS is the radius of the SLS, ^a('') is the (comov- 
ing) spatial auto-correlation function as given in Eq. (4) 
of RBSG08, and r is the comoving spatial separation of a 
pair of points on the SLS. This integral differs from that 
in Eq. (9) of Spergel et al. (2003), since here we focus on 
spatial separation, while Spergel et al. (2003) use an or- 
thogonal projection of the angle between a pair of sky posi- 
tions. The minimum angular separation used by the latter 
is 60° ~ 1 rad, so here we use rsLS for the minimum separa- 
tion. Provided that we estimate S^ in the same way for both 
observational data and simulations, the difference between 
spatial and projected angular definitions of S^ should not 
affect the questions of interest here. 

3. Method 

The method of using Markov Chain Monte Carlo simula- 
tions to optimise the cross-correlation ^c of temperature 
fluctuations between copies of the surface of last scattering 
(SLS) in the covering space S^, modelled as S^ C M**, is 
described in Sect. 3 of RBSG08. 

Since it is important that the choice of pairs of points 
is as uniform as the nature of the observations allows, the 



Roukema, Bulinski & Gaudin: Poincare dodecahedral space parameters 




Fig. 2. As for Fig. [T] showing a spatial geodesic, of length r2, 
"internal" to the matched circles, and angle a_ separating the 
member of the pair on the right-hand SLS from the dodecahedral 
face centre. 



selection of points on a copy of the SLS in RBSG08 was cho- 
sen uniformly on the SLS, i.e. on the 2-sphere. The binning 
into bins of pair separations was calculated only after the 
pair separations have been calculated by applying a holon- 
omy transformation to one of the members of the pair. 

From a computational point of view, applying the holon- 
omy is the most intensive step in the calculation, requiring 
the multiplication of a 4 x 4 matrix by a 4- vector, as a rota- 
tion in R^. This matrix calculation is carried out 12 times 
for a given pair of points, since all of the 12 holonomy trans- 
formations mapping a point to one of its neigbouring copies 
of the SLS must be examined. 

The set of 20 holonomy transformations that maps the 
fundamental domain to the next layer in the direction of 
the hyperspherical equator could, in principle, be used too. 
However, for these to give matched circles as small as 5°, 
this would require the total matter-energy density to be 
^tot ^ 1.03, which is uncomfortably high given present ob- 
servational estimates of iltot- To see this, replace tt/IO in 
Eq. (15) of RBSG08 by 7r/6, which is half the geodesic 
length of any member of this set of holonomy transforma- 
tions (Clifford translations), and use Rq — (c/iJo)(f^tot — 
1)""'^/^, where Re is the radius of curvature, i.e. the radius 
of S^ modelled as a subset of M"*. Moreover, it is not clear 
how a "generalised" twist parameter could be used if both 
sets (or yet further sets) of holonomy transformations were 
to be used in a single estimate of the cross-correlation func- 
tion. The twist for this set of holonomy transformations is 
±7r/3, not ±7r/5. For these reasons, we consider just the 12 
immediately neighbouring copies of the fundamental do- 
main, as in RBSG08. 

In this paper, we argue that after uniformly selecting 
points on the 2-sphere, a filtering of these points is possible 
in a way that excludes only those points that are certainly 
not members of any close SLS-SLS pairs, for a given maxi- 
mum pair separation r2 ■ This should enable shortening the 
time to find a given number of close pairs for a constant 



number of points Np, as well as finding a larger number of 
close pairs for the same value of Np^ since points that are 
certainly not members of any close pair are rejected before 
the iteration over pairs of points is started. The latter im- 
provement to the algorithm should enable practical use of 
the method for shorter pair separations. 

3.1. Preselection of potential members of close pairs: ai,a2 

Figures [T] and [2] show how a close pair used in a cross- 
correlation estimate relates to the two copies of the SLS 
and the angle a± that separates it, on a copy of the SLS, 
from a dodecahedral face centre. Using the spherical co- 
sine formula, the lower triangle in the two figures has the 
relation 

cos = cos cos — h sm sm — cos a± (3) 

Re Re o Re 5 

where a+ and a_ are for Figs [T] and [2] respectively. 

Clearly, a+ is maximised when x is maximised. For fixed 
r2 and rgLS, x is maximised when x = rshs + ''2, i-e. when 
the upper triangle degenerates into a single line segment. 
Hence, the angle a+ is maximised when x = r^hs + ''2- 

Similarly, a_ is minimised when x is minimised, pro- 
vided that r2 < ^sLS, which is the situation most interest- 
ing for the SLS-SLS cross-correlation method, since close 
pairs are the most useful. For fixed r2 and rgLSi x is min- 
imised when X = rshS ~ '"2, i-e. when the upper triangle 
degenerates into a single line segment. Hence, the angle a_ 
is minimised when x = rshS ~ ^2 • 

Hence, by symmetry, for a given separation r2 < ''slS: 
the minimum and maximum boundaries for defining an an- 
nulus 



a_ < a < «-(- 



(4) 



on the SLS around a dodecahedron face centre, in order 
to include all points that can potentially be members of 
a close pair separated by a spatial geodesic distance of at 
most r2 for the corresponding pair of matched faces of the 
fundamental domain, are 



a± 



cos 



rsLS±''2 

Re 



cos ^^^^ cos f 

He 5 



sin ^^^ sin ^ 



(5) 



When r2 is larger than the separation of dodecahedral face 
centres (Eq. (32) in RBSG08), i.e. when 



r2 > 2Re 



frsLS 
\Rc 



10 



(6) 



the derivation leading to the expression for a_ in Eq. [5] is 
no longer valid. Instead, the lower limit a_ should be set 
to zero. 

As mentioned above, the preselection enabled by Eq. ([5]) 
can be applied in two ways: 

(i) a point that fails to be a member of a "close" pair for a 
given maximum separation r2 in all of the 12 directions 
of holonomy transformations to adjacent copies of the 
point in the covering space can be removed from the list 
of uniformly selected points; and 

(ii) when iterating through pairs of points and holonomy 
transformations gi (i = 1, . . . , 12), a pair of points for 
which at least one of the two points does not satisfy the 
condition a < a+ can be rejected without calculating 
the spatial separation of the pair. 
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When r2 is large, a+ will also be large, and the six pairs of 
annuli may be sufficiently wide that together they cover the 
whole sky. In this case, effect (i) will not occur. For a small 
enough separation r2, the effect should occur. In that case, 
more points can be uniformly selected from S"^ according to 
a uniform distribution, and again tested, until the required 
number of points is obtained. In that case, a higher fraction 
(though not 100%) of pairs defined by this set of points will 
be useful for the cross-correlation function calculation. This 
should increase the number of pairs per bin, especially for 
the smallest bins, which have the fewest numbers of pairs. 
This would be useful for high resolution calculations. Both 
effects (i) and (ii) should increase the calculation speed for 
a given number of points A'p, since they avoid having to 
carry out unnecessary matrix multiplications. 



3.2. Use of preselection on the five-year WMAP data 

The MCMC analysis is performed as in RBSG08, using the 
ILC and TOH five-year WMAP data (Sect. ^, but in- 
cluding the smallest separation bin, i.e. using the full range 
of separations 



Table 1. Example of benchmarking on a 3GHz processor.' 



d/rsLS < 40/90, 



I.e. 



d< 4.4/1-1 Qp^ 



(7) 



(8) 



for fltot = 1.01, matter density fim = 0.28 and SLS red- 
shift zsLS — 1100, corresponding approximately to angles 
on the SLS ^9d^ 25°. All other parameters are kept as in 
Sect. 3.6 of RBSG08. In particular, this includes the five pa- 
rameters for orientation of the fundamental dodecahedron 
(galactic longitude and latitude of one face centre (Z, b) and 
a rotation parameter 9 around the axis defined by (/,&)), 
the matched circle size a and the "generalised" twist phase 
when matching opposite faces. The GPL (GNU General 
Public Licence) program CIRCLE^J is used. 

3.3. Analysis of simulations 

Although the method presented in Sect. 13.11 should speed 
up the calculation, analysing a large number of simulations 
is still time-consuming. Here, we estimate S^ for 50 sim- 
ulations and the WMAP observational map described in 
Sect. 12. 21 and select those 20 simulations whose S^^ estimates 
are smallest, provided that S'^ is not less than S'^ of the ob- 
servational map. In each estimate, 10,000 points selected 
randomly from a uniform distribution on the sky outside 
of the kp2 mask were used. Using each of these 20 simula- 
tions and the observational map, four MCMC chains with 
random starting points in the parameter space described in 
Eq. (28) of RBSG08 are carried out. 



^ Version CIRCLES-0.3.2.1 was used for calculating the 
MCMC chains for the five-year WMAP maps, and ver- 
sion ClRCLES-0.3.8 was used for calculating the chains 
for the INC3 observational and simulational maps. 
Various versions of CIRCLES are downloadable from 
http: //adjani .astro.umk.pl/GPLdownload/dodec/ These 

and earlier versions of the software require medium to advanced 
gnu/Linux, Fortran77 and C experience for a scientific 
user. 
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^ The pseudo-random number generator has the same initial 

seed for each calculation. 

^ An estimate of a+ from Eq. ((5)l is shown in the cases where 

pair preselection as described in Sect. 13. Il is used. 

'^ Calculation time. 

"^ Numbers of pairs in the smallest bin Na,Nc for ^a,Cc 

respectively. 



4. Results 

4.1. Benchmarking 

An example set of calculation times and number of pairs 
in the smallest separation bin are shown in Table [U for 
one calculation of the auto-correlation and cross-correlation 
functions at an arbitrary PDS orientation and twist, for 
a = 5° and a = 60°, which determine the ratio rshs/Rc- 
The speed-up factors range from about 3-10, depending on 
both a and r2. 

For d < r2 — A.Ah^^ Gpc, the "annulus outer radii" 
are a+ = 45.0°, 90.9° for a = 5°, 60° respectively. This 
is clearly too large to allow any removal of points from the 
list of potentially useful pairs, i.e. effect (i) in Sect. l3.1l does 
not occur: Na and iVc are negligibly affected by the pair 
preselection mechanism. However, the labelling of points to 
record which annuli they can potentially be pairs of does 
yield a speed-up through effect (ii), i.e. by factors of about 
7 and 3 for a = 5° and 60° respectively. 

For a much smaller maximum pair separation, i.e. d < 
r2 — 0.4/i^i Gpc, even though the annulus outer radii are 
still quite large, i.e. a+ = 13.6°, 62.9° for a = 5°, 60° re- 
spectively, both effects (i) and (ii) occur. That is, not only 
is there a speed-up by a factor of about 7-10, but there is 
also an increase in the number of pairs in the smallest bin 
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Fig. 3. Full sky map [Lambert azimuthal equal area projection 
(Lambert 1772), centred on the North Galactic Pole (NGP), 
with the 0° meridian as the positive vertical axis and galactic 
longitude increasing clockwise] showing the optimal orientation 
of dodecahedral face centres based on 160,000 steps in 16 MCMC 
chains, using the five-year ILC map (upper panel) and the five- 
year TOH map (lower panel), and the kp2 mask, showing face 
centres for which P > 0.5. 



for calculating the cross-correlation function, by factors of 
about 30 and 7 for a = 5° and 60° respectively. 

In practice, use of a small maximum pair separation bin 
to get cosmologically significant results will be complicated 
by the relatively larger contributions from the Doppler and 
ISW effects, from residual foreground contamination, from 
the differences between various versions of the all-sky CMB 
map at these resolutions (Aurich et al. 2006), and from the 
absence of the large scale signal. Nevertheless, the concern 
expressed in Sect. 5.5 of RSBG08 that improvements to the 
algorithm would be geometrically quite complex, at the risk 
of introducing biases to the method, appears to have been 
overcome, reducing one obstacle to small scale work using 
this type of method. 

In a full MCMC chain, the matched circle size a will 
vary between the limits illustrated in Table [1] The actual 
speed-up factor (and increase in numbers of pairs per bin 
if this occurs) will depend on the particular path of the 
MCMC chain through parameter space. In the calculations 
leading to the results described below, the range 5° < a < 
60° was retained. 



Table 2. Sky positions of the best estimate of the six dodeca- 
hedral face centres for the five-year ILC map with the kp2 mask. 



Jmin 


i" 


n 


I 


b 


^({i.b)) 








o 


O 


O 


0.4 


7 


5141 


182.8 


62.3 


1.1 


0.4 


12 


5239 


305.5 


44.4 


1.2 


0.4 


3 


7097 


45.0 


49.5 


0.7 


0.4 


5 


5812 


115.9 


19.5 


1.2 


0.4 


8 


5056 


174.6 


-3.0 


2.9 


0.4 


10 


4707 


239.8 


13.9 


2.1 


0.5'^ 


7 


2736 


181.1 


62.2 


1.4 


0.5 


12 


2838 


305.7 


44.5 


1.3 


0.5 


3 


3712 


45.4 


49.3 


0.7 


0.5 


5 


3005 


114.6 


18.7 


1.6 


0.5 


8 


2732 


178.1 


-0.5 


1.6 


0.5 


10 


2570 


239.9 


13.8 


1.6 


0.6 


7 


1429 


179.3 


62.6 


1.1 


0.6 


12 


1487 


306.0 


44.3 


1.2 


0.6 


3 


1919 


45.5 


48.8 


0.8 


0.6 


5 


1544 


113.6 


18.4 


2.0 


0.6 


8 


1387 


175.6 


-1.6 


3.0 


0.6 


10 


1419 


237.7 


14.5 


1.6 



'^ The face centres are ordered according to the ordering in 
Table 1 of RBSG08. The other 6 faces are directly opposite 
with identical errors. 

The estimate for Pmin = 0.5 corresponds to the points shown 
in the upper panel of Fig. E] based on 160,000 steps of MCMC 
chains. 



Table 3. Estimates of matched circle radius a and twist phase 
(j) from the Integrated Linear Combination (ILC) and Tegmark 
et al. (2003) (TOH) versions of the five-year WMAP data. 



map 


-tmin 


n'' 


a 

o 


O 




o 


o 


ILC 
ILC 
ILC 


0.4 
0.5 
0.6 


5508.67 
2932.17 
1530.83 


20.3 

20.5 
20.1 


0.7 

1 

0.7 


38.7 
37.8 
39.8 


1.6 
1.2 
1.1 


TOH 
TOH 
TOH 


0.4 
0.5 
0.6 


4802.08 
2955.58 
1655.92 


21.2 
20.4 
20.3 


1.4 
0.8 
0.7 


30.6 
32.1 
27.5 


4.8 
2.6 
5.5 



The number of steps n can be a non-integer since for a given 
MCMC step, it is possible that some of the face centres 
fall within the convergence radius of the final iteration as 
described in Sect. 14^ but other face centres do not. 



4.2. Parameter estimates from the five-year WMAP data 

For both maps of the five-year data as described in Sect. 12. II 
(ILC and TOH), iVchain = 16 MCMC chains were run, each 
starting with different random seeds, using the kp2 mask. 
Each run had 12,000 steps and the first 2000 steps of each 
were discarded!! Fig. [3] shows the sky positions (^&)i=i,i2 



The MCMC chains used in this paper 
can be downloaded for independent analy- 
sis from the file |http ://adjani. astro .umk. pi | 
l/GPLdownload/MCMC/mcmcRBGOSTtbzI 
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Fig. 4. Distribution of a and (j) states where P > 0.5 in the 
MCMC chains of the dodecahedral solution used in Table (2] for 
the ILC (upper panel) and TOH (lower panel) maps. In this and 
similar figures, lines indicating ±36° are shown. These are not 
fit to the data. 



implied by the {I, b, 9) triples in the MCMC chains for which 
P > 0.5 [Eq. (25), RBSG08], for the ILC and TOH versions 
of the five-year WMAP observational data. The "probabil- 
ity" function P used for optimisation by the MCMC pro- 
cedure is that defined in Eq. (25) in RBSG08. This is not 
a true probability function. 

Similarly to what was done in RBSG08, the 16 chains 
are grouped together into four groups, each of four chains. 
For a given group, steps 2001 to 12,000 from each of the 
four chains are concatenated. 

We make the convergence requirements a little more 
stringent than was described in Sect. 4.1 of RBSG08. That 
is, we start from an initial angular radius of /3i = 30° (cov- 
ering most of the sphere), decrease by 1° for the next 20 
iterations, and then remain constant at f3j>2i = 10° until 
the iteration for that face number converges or until a total 
of 40 iterations has been reached. The analyses of the four 
concatenated groups of chains give what are considered to 
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Fig. 5. Histogram of the twist angle shown in Fig.|4l with viV 
error bars, for the ILC (upper panel) and TOH (lower panel), 
excluding states with a < 15°. 



be four independent estimates in order to get an estimate 
of the uncertainties due to our MCMC estimation method. 

The resulting numerical estimates are listed in Table [21 
The columns show minimum "probability" Pmin , face num- 
ber i, number n of MCMC steps contributing to the esti- 
mate obtained from the final iteration, galactic longitude 
I and latitude 6, and the standard error in the mean be- 
tween the four estimates of different sets of MCMC chains, 
in great circle degrees, cr^(;^f,))- These values do not differ 
significantly from those in Table 1 and Table 4 (for the kp2 
mask) of RBSG08. 

The MCMC states for a and in the final radii of con- 
vergence, and their means and standard errors in the mean, 
are shown in Fig. [4] and Table [3l Histograms of the distri- 
bution of are shown in Fig. [5] Again, these values do not 
differ significantly from those in Table 2 of RBSG08. The 
TOH map shows a small offset between the best estimate of 
(\) found (for P > 0.5) using our convergence algorithm and 
that at which the histogram of states peaks. The former 
is = 30.4°, i.e. a little below 36°, while the latter (lower 
panel of Fig. [S]) is a few degrees above 36° . The difference 
can reasonably be attributed to the fact that the former 
uses a convergence algorithm in multi-dimensional param- 
eter space, whereas the latter is the peak of a projected 
distribution using a single parameter. 

As was discussed in Sect. 4.2 of RBSG08, when a 
is not too large, small changes in the comoving separa- 
tion between pairs lead to relatively large changes in a. 
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Table 4. S^ estimates with and without the kp2 mask for the 
ILC and TOH five-year maps. 



map kp2 no mask 

ILC 1012 4851 
TOH 1136 2749 



Does the use of the smallest bin in the present analysis, 
and/or the use of the slightly improved quality between 
the three-year and five-year WMAP data help overcome 
the large uncertainty in a? Visual inspection of Fig. 3] sug- 
gests that the MCMC chains on the ILC five-year map 
favour 22° <a<35°, while those on the TOH map favour 
15°^a <,35°. The problem of approximate degeneracy in 
a, which is presumably sensitive to moderate levels of sys- 
tematic error, remains. 



4.3. Simulations 

4.3.1. Estimates of S^ 

The value of S^ [Eq. ([2])] in the INC3 observational map 
(using the kp2 mask) is 



5f ^-^ = 963inKy 



(9) 



The 50 simulations have S^ in the range ll70{fiK)'^ < S^ < 
8645(/xiir)^, i.e. up to about an order of magnitude higher 
5^ than in the observations. As mentioned in Sect. 12. 2[ this 
is because the WMAP cosmological signal has a lot of power 
close to the Galactic Plane and we use the kp2 cut sky. In 
order to use the simulations best able to test the hypothesis 
that a simply connected universe with an observationally 
valid large scale auto-correlation can give a PDS-like signal, 
we select the 20 of these with the lowest S^ values, i.e. in 
the range 



1170{^iKf <S^< 3782{fiKY 



(10) 



i.e. with up to about 3.9 times larger 5"^ than in the obser- 
vations. Since at large scales (outside of the kp2 cut), these 
simulations are more correlated than the observations, we 
should obtain an upper limit to the the frequency of detect- 
ing PDS-like signals, assuming that the PDS hypothesis is 
wrong. 

As a check on the amount of large scale power present 
in the Galactic Plane in different versions of the map of cos- 
mological signal, we estimate S^ for the ILC and TOH five- 
year maps with and without the kp2 mask. Table 0] shows 
these estimates. Both maps have much more power with- 
out the cut than with the cut, as expected. Moreover, while 
the difference in estimates of S'j for the two maps differs by 
nearly a factor of two for the full sky, it differs by only 10% 
for the cut sky. This confirms the advantage in analysing 
the cut sky rather than the full sky: there is approximate 
consensus between these two different methods of generat- 
ing the map. The INC3 estimate of S^ = 963 (/iX)"' is also 
close to these two estimates. 
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Fig. 6. Full sky map showing the optimal orientation of dodec- 
ahedral face centres based on 40,000 steps in 4 MCMC chains, 
using the three-year INC map and the kp2 mask, showing face 
centres for which P > 0.5. 

Table 5. Estimates of matched circle radius a and twist phase 
</> for the INC3 observational map. 



pa 

^ mm 


n^ 


a 


<!> 






o 


o 


0.3 


294.3 


32.7 


27.6 


0.4 


197.2 


30.8 


31.2 


0.5 


108.8 


30.8 


32.5 



^ Minimum probability. 

^ Number of MCMC steps contributing to the estimate. 



4.3.2. MCMC chains 

While the speed improvement introduced in this paper 
makes it possible to analyse a set of simulations rather 
than just an observational map, carrying out large numbers 
of MCMC chains still remains computationally prohibitive. 
For this reason, we carry out only a small number of MCMC 
chains (four) on each data set (INC3 observational data set 
or simulation), and do not attempt to estimate uncertain- 
ties on the individual optimal parameters for a given data 
set. 

We first concatenate together steps 2001 to 12,000 of 
each chain in a group, i.e. we ignore 2000 burn-in steps. This 
concatenated chain is considered to be a single chain for 
the iterative procedure of estimating parameter values. We 
modify the method of choosing an initial rough estimate of 
the optimal dodecahedral face centre positions that is used 
to start the iterations towards a more precise estimate by 
randomly selecting a set of dodecahedral face centres [{I, b) 
chosen from a uniform distribution on S*^, chosen from 
a uniform distribution on (0, 27r/5).] This risks causing the 
convergence to be less accurate, but since this is applied in 
the same way for both simulations and observations, this 
should not introduce any statistical bias for comparison of 
observations to simulations. 

We also make our iteration a little more stringent than 
that described in Sect. 14.21 In the first iteration of param- 
eter estimation, we estimate the dodecahedral face centres 
starting from a random initial set as just described, and in 
the following iterations, we converge on both dodecahedral 



Roukema, Bulinski & Gaudin: Poincare dodecahedral space parameters 



--0.5 




-1.0 



-1.0 



-0.5 0.0 



0.5 



1.0 



Fig. 7. Full sky map showing the optimal orientation of dodec- 
ahedral face centres based on 40,000 steps in 4 MCMC chains, 
using the two simulated three-year INC maps with lowest S^ 
estimates (simulations 92 and 90, in the upper and lower panels 
respectively), and the kp2 mask, showing face centres for which 
P > 0.5. 
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Fig. 8. Distribution of a and (j) states where P > 0.5 in the 
MCMC chains for the INC3 observational map. 

Table 6. Properties of the distribution of best estimates of (f) 
from the 20 simulations. 



p . 

J mm 



0.3 32.9 5.9% 
0.4 38.5 13.1% 
0.5 38.4 32.8% 



^ Root mean square width of the distribution. 
^ Two-sided Kolmogorov-Smirnov probability that the sim- 
ulational values are consistent with a Gaussian distribution 
centred on zero, of width a^ = (jir.m.s.- 



faces centres and a, cf) simultaneously. The values oi (3j (see 
Sect. I4.2p are unchanged. The set of points within the con- 
vergence radius Pj>2i = 10° is used for the final estimates 
of a and <b. 



case, there is clearly a problem in converging on a single set 
of dodecahedral face centres, while in the other, there does 
appear to be more or less convergence to a single solution. 



4.3.3. Optimal dodecahedron orientation: {l,b,9) space 

Figure [6] shows the optimal set of face centres resulting 
from the four MCMC chains for the INC3 observational 
map. While the sharpness of the optimal signal does not 
appear as strongly as in the results using larger numbers 
of chains in RBSG08 and in Fig. [3] for the five-year data 
here, it is clearly consistent in position. Since we use many 
fewer chains here, it is unsurprising that the signal appears 
weaker. This should not be a problem form the present 
purpose, since the uncertainties from using a small number 
of chains should be statistically equivalent for both INC3 
observational data and simulations. 

On the other hand, the relative weakness of the signal 
and the smaller numbers of chains require using slightly 
lower minimum probability thresholds for estimating the 
best solution. Below, we cite results for Pmin = 0.3,0.4,0.5 
rather than the earlier thresholds of Pmin = 0.4, 0.5, 0.6. 

Examples of optimal dodecahedral orientations, for the 
two simulations whose 5^ estimates are lowest (i.e. are clos- 
est to that of the observations), are shown in Fig. [T] In one 



4.3.4. Circle size a and twist phase <j) 

Figure [8] shows that convergence in a and occurs for the 
INC3 observational map in a similar way to that of the 
five-year observational maps. In contrast. Fig. [5] shows that 
the two simulations whose S^ estimates best match that of 
the data, i.e. those whose best dodecahedral face centres 
are shown in Fig. [71 both "escape" towards the lower limit 
a = 5° . The median circle size is a = 5° in both cases. 
This effect can be expected due to the increased relative 
Poisson error when comparing fewer numbers of pairs of 
pixels. Aurich (2008) found a similar effect, describing it 
as "drifting" towards "large L" , which corresponds here to 
small a. Several previous authors have found that indepen- 
dently of whether or not a cosmic topology signal is present, 
relatively higher cross-correlations for zero separation pairs, 
i.e. for pairs on exactly matched circles, have been found to 
occur as the circle size a approaches zero. For example, see 
Fig. 2 in Cornish et al. (2004), Figs 4-6 in Roukema et al. 
(2004), or Fig. 3 in Lew & Roukema (2008), where various 
definitions of a normalised correlation statistic S are shown 
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Fig. 9. Distribution of a and (j) states where P > 0.5 in the 
MCMC chains for the two simulated three-year INC maps with 
lowest 5*5 estimates (simulations 92 and 90, in the upper and 
lower panels respectively). 



to increase as a decreases towards zero. Hence, it can be ex- 
pected that MCMC chains will be drawn towards the lower 
a limit. 

In cases where this occurs, chains spend a large amount 
of time at this limit, but cannot go below it. For this rea- 
son, the median of the a estimates (above a minimum 
"probability" used in the MCMC chains, e.g. P > Pmin = 
0.5), should more accurately represent the optimal region 
favoured by the MCMC chains than the mean. Thus, here 
we use the median of a. 

Figure [10] shows that nearly all the simulations escape 
to the lower a limit. Most of the simulations have best 
estimate a values less than 10°, and two have a estimates 
just a few degrees higher. Only two of the simulations have 
an optimum a estimate anywhere near the circle size a 
of the INC3 observational best estimate given in Table [5l 



„INC3 



30.8° (for P„ 



0.5). 



Do the latter two simulations, which are similar to the 
observational map in the sense that an optimal solution 
away from low a limit is found, have sufficiently similar 
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Fig. 10. Median circle sizes a (see Sect. l4.3T4|) and twist angles 
(j} (mean) for each of 20 simulations (empty circles) and the 
WMAP observational map (solid circle, values given in Table (SJ, 
analysed using the steps with P > 0.5 [see Eqs (25), (26) of 
RBSG08]. In each case, the 10,000 steps following 2000 burn-in 
steps of each of 4 MCMC chains started at random points in 
parameter space are concatenated for this analysis. 
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Fig. 11. Median circle sizes a as a function of S^ for each of 
the 20 simulations (empty circles) and the WMAP observational 
map (solid circle), analysed using the steps with P > 0.5. 



characteristics to the observational map such that the ob- 
servational map can be considered a chance realisation sta- 
tistically similar to these two simulations? Figure [11] does 
not support this. The two simulations are among the sim- 
ulations that have the highest S^ estimates, well above 
three times that present in the observational map. This is 
consistent with what is expected: the stronger the auto- 
correlation on large scales, the higher the chance is of 
getting high cross-correlations between apparently distant 
parts of the sky over a large number of pixel pairs, rather 
than escaping to the low a limit where there are relatively 
few pixel pairs. This can be quantified as follows. 
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Fig. 12. Full sky map showing the optimal orientation of 
dodecahedral face centres based on 40,000 steps in 4 MCMC 
chains, using the two simulated three-year INC maps with the 
highest a estimates (simulations 58 and 80, in the upper and 
lower panels respectively), and the kp2 mask, showing face cen- 
tres for which P > 0.5. 



Since the a distributions are constrained from below, 
they are unlikely to be Gaussian. So, estimating the sig- 
nificance of the correlation between a and S^ among the 
simulations is best done using a non-parametric statistic. 
Spearman's rank correlation p and Kendall's rank corre- 
lation T one-sided tests with a positive correlation as the 
alternative hypothesis give probabilities that a and S^ are 
unrelated of 2.5% and 2.4% respectively. Even if we ar- 
bitrarily remove the two simulations with high a from the 
data set, the same two two rank correlation tests give prob- 
abilities that a and S(^ are unrelated of 12.6% and 12.5% 
respectively. 

This supports the visual impression from Fig. 1111 For 
low Sj, an optimal cross-correlation in GRF simulations 
should escape to the low a limit. The point representing the 
observational simulation appears to be quite exceptional. 

Do the two simulations with high a estimates (num- 
bered 58 and 80, with a estimates of 24.5° and 31.9° re- 
spectively) have convergent estimates of dodecahedral face 
centres and 0? Figure [T^ indicates a poor convergence of 
dodecahedral face centres for simulation 58 and what looks 
like the superimposition of a strong primary and a weak sec- 
ondary convergence in the case of simulation 80. Moreover, 
Fig. [THl shows that both of these two simulations have quite 
strongly bimodal distributions in cj) rather than favouring 
any individual optimal value of the twist 0. This is quite 
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Fig. 13. Distribution of a and states where P > 0.5 in the 
MCMC chains for the two simulated three-year INC maps with 
highest a estimates (simulations 58 and 80, in the upper and 
lower panels respectively). 
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Fig. 14. Histogram of the optimal twist angle (shown in 
Fig. nop , with viV error bars, together with a Gaussian distri- 
bution of width 38.4°, centred at zero (see Table |6]). 
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different behaviour to that in Fig. [5] for the observational 
map. 

However, in order to be conservative, let us suppose that 
simulations 58 and 80 converge well enough in comparison 
to the observational map that we can consider them to have 
convergent MCMC solutions with a > a^"^^^ = 5°. This 
gives us an estimate 



P{a » a"""* I S(^ < S.gS-P^^) < iQ%^ 



(11) 



where a ^ a'""'* represents the event of getting a non- 
Poisson-noise signal at least 10° (the MCMC step size) 
away from the lower limit of a'""'*. 

4.3.5. Distribution of the optimal twist phase (/) 

It is clear in Fig. [10] that the simulations give a distri- 
bution of best estimate twist angles ^ different from the 
uniform distribution on [— 7r,7r] described in Eqs (12) and 
(13) of RBSG08 and discussed in Sect. 5.4 of that paper. 
Figure [TJ] shows the distribution as a histogram. A two- 
sided Kolomogorov-Smirnov test between the distribution 
of the best estimates of from the 20 simulations and a uni- 
form distribution on [— tt, tt] rejects equality with P — 0.01. 
The values of (j) for the simulations, shown in Figs 1101 and 
\\M all lie in the range (—100°, +100°), and mostly seem to 
cluster even closer to = 0. 

A likely explanation is that this is a consequence of 
the anti-correlation in the auto-correlation function mea- 
sured in WMAP sky maps at nearly antipodal scales by 
different authors using different methods (Fig. 16, Spergel 
et al. 2003; Fig. 1, RBSG08). The Hinshaw et al. (2007) 
Ci values should implicitly include the information that 
there is an antipodal anti-correlation. Estimates of the 
auto-correlations of the simulations confirm that an anti- 
correlation of about — lOO(^K)^ at (/> = ±7r is present, so 
that the simulations correctly reproduce this characteris- 
tic of the observational data. This anti-correlation implies 
that MCMC chains should disfavour (j) — ±7r when corre- 
lating pairs on a matched circle pair, and, hence, generally 
disfavour (j) = ±7r for pairs on "matched annuli" . This is 
discussed further in Sect. 15.3."^ 

Since a uniform distribution on [— tt, tt] is clearly wrong, 
a reasonable hypothesis must be made regarding the in- 
trinsic, expected distribution of best estimates of (j). Given 
the numerical results from the simulational analyses and 
the presence of the cf) — ±7r anticorrelation, which should 
favour (j) away from ±7r and towards zero, we assume a 
Gaussian distribution, /(0), centred on = with width 
CT0 estimated by the r.m.s. of cj) in the simulations. 

The parameters of this distribution for different Pmin 
values, and the Kolmogorov-Smirnov probabilities that the 
simulational values are consistent with a Gaussian distri- 
bution, are given in Table [G] For the lower thresholds, 
Pmin = 0.3 and Pmin = 0.4, the Gaussian distribution 
hypothesis is mildly rejected by the Kolmogorov-Smirnov 
test. If we set the mean and standard deviation of the 
Gaussian distribution to be the mean and standard de- 
viation of the simulational <f> estimates, then the three 
Kolmogorov-Smirnov probabilities for Pmin = 0.3, 0.4, 0.5 
are Pks = 54%, 84%, 97% respectively. Thus, it is clear 
that the distributions are consistent with Gaussianity if we 
use the estimated means rather than force symmetry about 
= 0. 



Could there be a reasonable justification for using a non- 
zero mean 0? The only possible source of systematic asym- 
metry is that the noise simulations follow noise patterns 
in the observational data, which are not perfectly symmet- 
rical. It is not obvious that this asymmetry is sufficient to 
justify assuming an asymmetry in the expected distribution 
of 0. Moreover, the means and standard errors in the mean 
for the three Pmin thresholds are 12.8 ± 7.1°, 14.4 ± 8.4°, 
12.1 ± 8.6° respectively, showing no statistically significant 
difference from zero. 

Given a Gaussian distribution in centred on zero with 
width Ufj, = 0r.m.s. ^s listed in Table [SI the probability that 
is as close to ±7r/5 as the observational value (Table [5]) 
is close to +7r/5 is 22%, 12.8%, 9.4% for Pmin = 0.3, 0.4, 0.5 
respectively, i.e. 



P(min(|0±7r/5|)< 10^^^^ 
< 22%. 



^/5| 



S^ < 3.95*^ 



INC3 



(12) 



Since the values 0r.m.s. are themselves not too far from 7r/5, 
it is clearly not so improbable that ~ ±7r/5, compared 
to what would be expected from a uniform distribution on 
[—TT, tt]. In other words, this suggests that using the present 
method, an estimate of is not as good a discriminator be- 
tween a chance PDS-like signal and an intrinsic, physical 
signal as it would be if the expected distribution were uni- 
form on [— 7r,7r], i.e. there is another topological degeneracy 
(cf Aurich et al. 2005a) in CMB all-sky maps. 

On the other hand, is it just a coincidence that 0r.m.s. '^ 
7r/5? In Sect. l5.3.2l below we discuss this question. It is pos- 
sible that the empirical Ci spherical harmonic spectrum, 
even with randomised phases, may encode more cosmic 
topology information than might naively be expected. 

4.3.6. Probability of rejecting the simply connected, infinite, 
flat model 

The analyses of these simulations indicate that the require- 
ment for the MCMC chains to avoid "escaping" to the lower 
limit in circle size a gives a stronger constraint against the 
simply connected, infinite, fiat model than the requirement 
that could potentially exclude the PDS model, i.e. the re- 
quirement that « ±7r/5. 

Let us write the PDS-like characteristics of the WMAP 
observational data which we have tried to reproduce by 
simulations as follows. The data 

(i) have a large scale cutoff in structure statistics 
(ii) yield a solution with a ^ Q,iimit ^^^^ ^ ^ -|-7r/5 
when using the MCMC method for optimising the cross- 
correlation ^c for the "generalised" PDS. 

Rewrite these as 



(i) S-^^S-fca 

(ii) ^c yields a > a"""* and min(|0 ± 7r/5|) < |0- 
7r/5| < 8.4°, 



INC3 



where we use the "worst" estimate of from Table [51 i.e. 
— 27.6°, for Pmin = 0.3. For convenience, we write these 
even more compactly as 

(i) 5c<5fC3 

(ii) fWMAP^ 
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Since both S^ « S^"^^ and ^wmap ^^.g p^sent in the 
WMAP data, the probability of these two characteristics 
both occurring in the simply connected, infinite, flat model 
can be written P[{S^ w S'|nc;3) ^^wmapj p^.^^-^ ^^^ results 

above, in particular from Eqs (|lip and (J12p . we can write 



Pii 



WMAP I 



5e<S'f^-^) = p 



a ^ a 



limit 



and 



min(|(/.±7r/5|)< \^'^^^'^ - n/5\ 



5^<5™^^ 



< 10%. 



S^<S^ 



INC3\ 



54<3.95fC3), 

(13) 

Are the probabilities that a ^ a'""'* and (j> « ±7r/5 inde- 
pendent of one another, so that instead of the Eq. (|13p . we 
can write 



P 



a ^ a 



limit 



and 



min(|0 ± 7r/5|) < 



/INC3 



7r/5| 



Si<S, 



INC3 



= P(a >«'""'* 5^<S'^ 



:INC3\ ^ 



P 



min(|(?!)±7r/5|) < |^- 



INC3 



7r/5| 



c < cINC3 



2.2%, 



(14) 



again using Eqs (fTTj) and p2|) ? Spearman's rank correlation 
p and Kendall's rank correlation r one-sided tests where 
min(|^ ± 7r/5|) decreases as a increases as the alternative 
hypothesis both give probabilities that these two param- 
eters are unrelated of 7.9%. A two-sided test gives 16% 
in both cases. While neither of these rejections of the hy- 
pothesis that the two parameters are unrelated is highly 
significant, they are strong enough rejections that it would 
premature to assume that the two parameters are indepen- 
dent. Hence, the probability estimate in Eq. ([M]) cannot 
(yet) be assumed to be valid. 

As mentioned above, Spergel et al. (2003) estimated 
that P{S^<Sf^^^^) - 0.15% for an infinite, flat, 
cosmic concordance model, with a fixed spectral in- 
dex of density perturbations. Efstathiou (2004) estimated 
P{S^ < 5WMAP) ^ 3-12.5%. The latter calculation recon- 
structs unobserved structure hidden behind the galactic 
mask by assuming a simply connected model. For the pur- 
poses of testing the simply connected model hypothesis, 
this is internally consistent. 

Using these two estimates to write two respective esti- 
mates of P{S^ < 4NC3) ^ p^g^ < ^.WMAP)^ ^e j^j^^g 



INC3\ 



WMAPi 



P[(5^<5^™-^)nCc 

P{S^ < 5f ^') P(e^^^P|5^ < 5f C3) (15) 

by the definition of conditional probability. Hence, from 
Eq. ([n]), we have 



(16) 



P[(S'e<S'fC3)ne^^^^] < 0.015%, 
P[{S^<Sf''^)n(^^^''] < 1.25% 

for the lower and high estimates of P{S^ ^ SV"'-^^) respec 
tively. 



In other words, the simultaneous existence of both of 
these two properties of the WMAP data, one a generic 
characteristic of small universe models and the other highly 
specific to the PDS, is unlikely with a probability of about 
99.99% or 99% depending on whether the Spergel et al. 
(2003) or Efstathiou (2004) estimates of P{S^ < S^wmap) 
are used. This result only requires a frequentist approach 
to probabilities. Bayesian modelling of prior beliefs is not 
invoked. 



5. Discussion 

5.1. Matched annuli and calculational speedup 

The preselection method described in Sect. 13.11 leads to 
faster calculation times by a factor of about 3-10 fSect. B?T|) . 
This is by eliminating most calculations of pair separations 
for pairs which are not useful for the cross-correlation calcu- 
lation. Is it possible to improve this algorithm even further? 
Given that we have Eq. (O, the number of calculations re- 
quired for a given total number of pairs could, in principle, 
be reduced by another small factor as follows. 

Randomly select a point pi from a uniform distribution 
on S^, considered to be the left-hand copy of the SLS in 
either Fig.[T]or Fig. [51 For each of the 12 holonomy transfor- 
mations Qj to adjacent copies of the fundamental domain, 
check if the angle aij on the SLS from pi to the dodeca- 
hedral face centre for gj satisfies a_ < a^ < a+. If this 
constraint is satisfied, then choose a second point p^ ran- 
domly from a uniform distribution on the circle defined by 
the intersection of the right-hand copy of the SLS in Fig. [T] 
or Fig. [2] and the 2-sphere centred at pi, having radius r2. 

By construction, all of the pairs {pi,Pi) selected in this 
way are at separation r2 , and should be statistically equiv- 
alent to generating a full set of pairs of which both mem- 
bers are uniformly selected on S*^ and then selecting those 
whose separations are close to r2- A loop over values of 
r2 will give cross-correlations over the desired range of 
separations. This construction would bring the calculation 
method closer to the identified circles method itself, with 
the difference that instead of correlating points lying pre- 
cisely along the circles, points "near" the circles are corre- 
lated. 

This raises the question of differences between using 
the present method and using the identified circles method 
with "thickened" circles. The most likely differences be- 
tween these two methods would depend on how "thicken- 
ing" is defined and how overlapping thickened circles are 
dealt with. 

"Thickening" would require a somewhat arbitrary 
choice of an averaging procedure. For example, a circle 
could be divided into equal angular intervals and thick- 
ened so that individual bands of the annulus are internally 
averaged to obtain the temperature fluctuation at that an- 
gular position around the circle/ annulus. Alternatively, a 
Gaussian smoothing could be used. However, neither of 
these methods would take into account the fact that the 
changes in the spatial geodesic pair separation of points 
inside and outside the zero thickness circle are not linear 
with change in their angular distance from the centre of a 
circle. In contrast, the present method calculates and uses 
the spatial separations directly. 

The problem of overlapping thickened circles follows 
from the fact that twelve pairs of annuli on the 2-sphere 
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intersect each other many times. The greater the thick- 
ness of the annulus, the greater the number of points in 
these intersections. If a correlation statistic is calculated 
over each circle separately and then averaged, then points 
which are members of these intersections contribute sev- 
eral times (typically two or three) more to the final statis- 
tic than points which are members of only one annulus. 
If chance fluctuations at some pairs of these intersections 
happen to have high correlations or anti-correlations, then 
the final statistic might be biased by these pairs, since it 
would implicitly assume that the pairs are independent of 
one another, even though this is false in some cases. 

The use of numerical simulations should make this more 
of a problem of excessive noise rather than systematic bias. 
It is difficult to see any simple way in which a method di- 
rectly based on the identified circles principle could avoid 
this problem, whether or not its final effect is statistical 
bias or rather an extra source of noise. For the SLS opti- 
mal cross-correlation method, this problem does not exist, 
since both points are selected uniformly on S*^ and selected 
afterwards based on their spatial separations (the method 
as presented in RBSG08), or else preselected in a way that 
is equivalent to this (the present paper). 

Hence, results from comparing a series of thickened cir- 
cle pairs for a range of thicknesses to use of the SLS optimal 
cross-correlation method should be expected to be different, 
for both geometrical and statistical reasons. 

5.2. Five-year WMAP data 

The five-year WMAP data (Sect. K2\i give best estimates 
of the PDS model similar to those for the three-year data. 
Since most of the improvement in the WMAP data is at 
small angular scales, this is unsurprising. 



5.3. Simulations 

Given the infinite, flat model with GRF as a null hypothe- 
sis, the probability that the observed WMAP data could 
be a random realisation of this model, i.e. that both a 
large-scale cutoff in power and a specifically PDS-like sig- 
nal appear in the WMAP data, is estimated in Eq. P^ as 
0.015% or 1.25% depending respectively on whether we use 
the Spergel et al. (2003) or Efstathiou (2004) estimates of 
the former. This appears to be a significant rejection of the 
infinite flat model. Is there any way to avoid this interpre- 
tation? Other properties of the cosmological component of 
the WMAP data, unlikely in the infinite fiat model, have 
been noted by many authors. It is unlikely that these dif- 
ferent properties are statistically fully independent of one 
another. 

On the other hand, in this case we have a physically 
motivated model, motivated from the most fundamental 
level: the spatial section of the Universe is a 3-manifold, 
i.e. it must have a "shape". As Schwarzschild (1900, 1998) 
stated a little over a century ago, that shape may well be 
multiply connected rather than simply connected. 

More recently, the generic predictions of a cutoff in 
power at large scales were made when only the COBE data 
were available (Starobinsky 1993; Stevens et al. 1993). The 
identified circles principle (Cornish et al. 1996, 1998), of 
which the present method is an extension, and the matching 
of the Poincare Dodecahedral Space hypothesis to the large 





Fig. 15. Schematic diagram showing two circles on the SLS on 
which cross-correlation is to be optimised, for twist (j> — 0. The 
fluctuations are zero everywhere on the two circles except at the 
points A and B on the first circle and C on the second circle. At 
all three points the fluctuation amplitude is -1-1. 



scale lack of power and the estimates of iltot ~ 1.01-1.02 
(Luminet et al. 2003), were published without the knowl- 
edge that the present version of the method, allowing for 
a "generalised" PDS of arbitrary twist angle, would be ap- 
plied to the WMAP data. It seems physically unreasonable, 
therefore, not to combine the probabilities of the different 
signs of cosmic topology. 

In this case, to argue that the probability in Eq. [16] 
has underestimated, either the Spergel et al. (2003) or 
Efstathiou (2004) probability estimates of a low S^ have to 
be increased, or the estimate of the probability of a PDS- 
like signal, given a low S^, (Eq. [T5)) has to be increased. 
We have conservatively taken the maximum probability es- 
timate in Table 5 of Efstathiou (2004) for the former. 

For the latter, we have conservatively assumed that 
the two simulations with a 3> a'""'* have a signal given 
by the MCMC method similar enough to the signal in 
the observational map that we can set this probability at 
p(^WMAP|5.^<5iNC3) < ;lo%. To increase this probabil- 
ity significantly, it would be necessary to argue that many 
or most of the simulations have convergence characteris- 
tics, best estimates of a » Q,iimit g^j-^^ ^^^^ estimates of 
(j) « ±7r/5 similar to those for the observational map. Given 
the results presented in Sect. 14.31 a systematic error of this 
sort able to satisfy this seems unlikely. 

On the contrary, Fig. [TT] and the rank correlation statis- 
tics (Sect. 1^3. 4[) favouring a positive correlation between a 
and S^ suggest that P{C^^^^\S^ <Sl^'^^) is sma//er than 
what we have been able to estimate with a small number 
of simulations. This is consistent with what was argued in 
RBSG08, i.e. that the lower the amplitude of the large-scale 
auto-correlations, the less chance there should be of cross- 
correlations occurring in the absence of a PDS-like signal 
or for a wrong orientation of the PDS model. 

This suggests two possible alternative approaches to 
that used here. Either we could generate simulations so 
that m,ost of the set of simulations have 5*^ about as low as 
that observed — with the kp2 cut — or we could generate 
simulations for the full uncut sky. However, both of these 
approaches have problems. The problems in both cases arise 
from the fact that a large amount of the large scale power 
is estimated to lie close to the Galactic Plane, but the es- 
timates of how much this power is and of how precisely we 
can measure the detailed fluctuations close to the Galactic 
Plane are highly uncertain. For example, Table|3]shows that 
the estimates for 5^ for the five-year ILC and TOH maps 
differ by only 10% when the kp2 cut is used, but differ by 
nearly a factor of 2 for the full sky. 
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Fig. 16. Distribution of the twist angle (f) in 16 MCMC chains 
run on the INC3 observational map, for states where P > 0.5 
(the four chains shown in Fig. [8] plus 12 additional MCMC 
chains), with viV error bars, excluding states with a < 15°. 



Suppose that we generate simulations so that most of 
the set of simulations have S^ about as low as that ob- 
served, using the kp2 cut. In this case, we implicitly as- 
sume that both the extrapolations from outside of the kp2 
cut to inside it, as well as the direct estimates for the full 
sky, vastly overestimate the power inside the kp2 cut. For 
testing a multiply connected model hypothesis, this could 
have some validity given that the spherical harmonics are 
not statistically independent from one another. However, 
this is not the null hypothesis that is to be tested with 
the simulations. The aim is to test the null hypothesis of 
a simply connected, infinite, flat model with Gaussian ran- 
dom fluctuations. This model implies a relation between 
structures inside and outside of the kp2 cut, which needs 
to be included in the simulations if we wish to correctly test 
the hypothesis. In other words, the statistics of fluctuations 
generated by simulations which are designed to mostly have 
small 5^ outside of the kp2 cut are unlikely to be statisti- 
cally equivalent to those of fluctuations generated in the 
way performed here. In the present work, the full set of 
simulations was generated by the Hinshaw et al. (2007) es- 
timates of the Ci values, and a subset of these was selected 
with the criterion that S^ outside of the kp2 cut must be as 
close as possible to S^ of the observations outside of the kp2 
cut. These two different methods of generating simulations 
are distinct. 

The second alternative to the present method would be 
to generate simulations for the full uncut sky, and to run 
MCMC chains on both these and the observational map. 
However, because of the intrinsic difficulty in correcting 
for emission in the Galactic Plane and as is indicated in 
Table |4] for two different versions of the all-sky map of the 
cosmological signal, there would then be a much greater 
systematic uncertainty in the results. Hence, both of these 
two alternative approaches have disadvantages relative to 
the method used here. 

In addition to our main results of probability esti- 
mates, it has become clear that the properties of the ex- 
pected distribution of (f> (for the null hypothesis of an 
infinite, flat model) are not simple. This can be under- 
stood generically by realising that the mathematical proce- 
dure we are using is pattern matching. A cross-correlation 



for the correct mapping between two genuinely correlated 
copies of a single pattern will necessarily yield a high value. 
However, for a pattern which is sufficiently complex, both 
cross-correlations for incorrect mappings between two gen- 
uinely correlated copies of a single pattern, and also cross- 
correlations for arbitrary mappings between two uncorre- 
lated patterns may in some cases yield high values due to 
the complexities of the patterns and chance correlations. 
From first principles, modelling this is unlikely to be simple. 
For this reason, simulations provide an algorithmic short- 
cut to estimating the likely distribution of </>, given a cer- 
tain family of patterns. Gaussian random fluctuations from 
a given Ci spectrum are one such family of patterns. For 
a different Ci spectrum or different statistical properties 
of the fluctuations, different characteristics of the expected 
distribution of may occur. Here we discuss some charac- 
teristics of interest. 



5.3.1. Tendency of an MCMC chain to favour regions of 
approximately symmetric around zero 

Figure 8 in RBSG08 and Figs [H and [5] each indicate that 
in addition to the main cross-correlation signal, there is 
also an additional, weak, secondary signal with an approxi- 
mately equal but opposite value of 0. This is clear in Table 3 
of RSBG08 and Fig.[5]for the five-year ILC and TOH maps. 
Since only four MCMC chains were carried out for each of 
the INC3 real and simulational maps for the main INC3 
analysis, we carried out 12 additional MCMC chains on 
the real INC3 map in order to create a histogram equiv- 
alent to those in Fig. [5l The resulting histogram of the ip 
distribution from the 16 chains is shown in Fig. 1161 This is 
clearly consistent with those in Fig. [5l in the sense that the 
+tt/5 peak strongly dominates over the — 7r/5 peak. 

Is it physically reasonable that a PDS model would give 
both — 7r/5 and +t:/5 as valid twist angles, even where one 
twist gives a much weaker signal than the other? At most 
one of these can indicate the correct 3- manifold of comoving 
space. A possible explanation for both twist angles to be 
favoured in the MCMC chains might be that the density 
perturbations went through some sort of resonance process 
during an early epoch. In that case, it is conceivable that 
a harmonic created at that early epoch would still remain 
present. 

The optimal cross-correlations in the simulations, in 
particular those shown in Figs [5] and [T31 also have bimodal 
distributions of preferred values of (j). Moreover, they ap- 
pear to give nearly equal weight to solutions with positive 
and negative values of 4> of about the same absolute value. 
This suggests that some type of pattern in the fluctuations 
can tend to cause an MCMC chain to favour regions of (p 
approximately symmetric around zero. 

The following schematic diagram shows that at least 
one specific pattern of fiuctuations on the sky can lead to 
some degree of symmetry of optimal values, to the ex- 
tent to which the real (or simulated) pattern mimics the 
idealised pattern. Figure [15] shows two circles on opposite 
sides of the sky, seen by someone external to the SLS, look- 
ing approximately but not nearly along the axis joining the 
two circles. If we approximate this region of the covering 
space S^ as an approximately flat region for intuitive sim- 
plicity, then the mapping corresponding to a "generalised" 
holonomy transformation from one circle to the other is a 
translation followed by a twist. If the twist is either +ip 
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Fig. 17. Schematic diagram of two circles on the SLS for twist 
= 0, illustrating that the observed anti-correlation at antipo- 
dal points on the SLS corresponds to an anti-correlation at twist 
(j} — TV on the circles. Solid circles at points A, B, and C indicate 
positive fluctuations, and hollow circles at the antipodal points 
A', B', and C indicate negative fluctuations. 



or —(j), then the summed cross-correlation is 1. If the twist 
is anything other than ±(/), including zero, then these two 
idealised circles will have a zero cross-correlation. 

The original identified circles principle, i.e. the present 
method in the limit of zero pair separations, would imply 
that both of these patterns are equally optimal. By exten- 
sion, our present method will find the same result for this 
idealised pattern. 

This schematic situation is clearly highly simplified. 
Apart from the fact that the fluctuation patterns are un- 
likely to be as simple as in Fig. [151 here we have assumed 
that the MCMC chain is fixed at a particular orientation 
on the sky and matched circle size. In reality, the chains are 
free to change both of these. Changing orientation and/or 
circle size would weaken the symmetry provided by the pat- 
tern in Fig. [151 Hence, to calculate the relevance of this 
schematic pattern realistically from first principles would 
be quite complex. However, given that we have realistic 
numerical simulations (for the infinite, flat model), we do 
have an illustration here of at least one basic pattern which 
could lead to best solutions which are approximately sym- 
metric in (f> around two values ±0, even though only one of 
these two values is the "true" value (physical or simulated) 
of the map. 

5.3.2. Properties of the distribution of optimal values 

Statistical properties of the distribution of optimal (f> val- 
ues found in the analyses of the simulations (Fig. [T3]) are 
different from what was intuitively expected. Not only is 
the distribution of not uniform on [— vr, tt], but it is mod- 
erately well fit by a Gaussian centred on (p — 0° of width 
0-0 = ^/{(jpj « 7r/5 (Table [6]). Why is the distribution of 
optimal (j) values not uniform on [— 7r,7r]? Why should a^ 
be approximately 7r/5? 

Consider the twist angle = tt. On matched circles, this 
represents pairs of antipodal points, independently of the 
matched circle size a. Figure [T71 illustrates this schemat- 
ically. In general, a pair of points on a pair of would-be 
matching circles are separated by a twist 0^7, where 7 
is the angle on the SLS separating the pair of points. The 
exception is that (j) — n when 7 = tt. This separation cor- 
responds to a spatial geodesic separation between the pair 
of points in the covering space S^ of 2rsLs • 

For antipodal points, ^a is slightly negative (Fig. 16, 
Spergel et al. 2003; Fig. 1, RBSG08). In the simulations 
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Fig. 18. Median circle sizes a and twist angles (j> (mean) for 
each of the 20 simulations, analysed using the steps with P > 
0.5, modified as described in Sect. 15.3.21 by inverting the anti- 
correlation in 50 randomly selected pairs of 10° radius antipodal 
patches on the sky. 
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Fig. 19. Histogram of values shown in Fig. 1181 The Gaussian 
distribution shown in Fig. 1141 is reproduced here. 



analysed here, this anti-correlation at = tt is approxi- 
mately — lOO{fiK)^ in most simulations. 

The MCMC chains are optimised to find positive cross- 
correlations, not anti-correlations. Hence, it should, in fact, 
be expectecO that optimal cross-correlations should cluster 
towards (p — and away from </> = tt. 

We test this with a toy model as follows. The angular 
scale above which the auto-correlation becomes negative in 
Fig. 16 of Spergel et al. (2003) is acos(-0.9) = 154°. To 
first order, we can approximately think of this as antipodal 
pairs of 25° discs on the sky being anti-correlated. So, we 
start with one of the simulated maps and randomly select 
50 antipodal pairs from a uniform distribution on the sky. 
For each antipodal pair, we calculate the mean temperature 
fluctuation within 10° of each pole. We choose this radius 
to be smaller than 25° so that there is a fair chance that 



This fact was missed in RBSG08. 
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the disc we use will mostly cover a single "anti-correlated" 
disc, of full size 25°. If we had chosen the full disc size, 
then frequently there would be too little overlap for an 
anti-correlation to be measured. If the product of the mean 
temperature fluctuations in the two discs of the pair is neg- 
ative, then we multiply the fluctuations in one member of 
the pair by -1. This is done for all 50 antipodal disc pairs. If 
there were no overlap between these discs, then this would 
cover about 80% of the full sky. 

While this procedure is not likely to give a valid sta- 
tistical set of fluctuation patterns for testing the inflnite, 
flat universe hypothesis, it should be sufficient for qual- 
itatively testing the hypothesis that the anti-correlation 
is responsible for the concentration of the expected dis- 
tribution of (j) away from tt and towards zero. Moreover, 
since the anti-correlation at the antipodes provides some 
contribution to 5^, this procedure decreases the mini- 
mum, mean, and median values of S^ for the 20 simula- 
tions from 1170, 2928, 3071(Ai-fs:)'' to 467, 1494, 1169(Ai-ft:)'^ 
respectively, i.e. by about a factor of two. The maximum 
value of S^ increases slightly, from 3782 to 4012(^ii')^. 

Figure [T51 shows the result of searching for optimal PDS 
solutions using four MCMC chains per modifled simulation, 
and analysing the chains as before. The distribution of best 
(p estimates is shown in Fig.[Tni Several effects of the antipo- 
dal anti-correlation reversal are visible in the two figures. 
Firstly, several optimal correlations occur close to the an- 
tipodes, i.e. at ^ ~ TT. Secondly, the chains which escape 
to ck''™'* have optimal values spread over a wider range 
than without the modification. These two effects are qual- 
itatively consistent with the hypothesis that the antipodal 
anti-correlation plays an important role in concentrating 
the expected (j) distribution for an infinite, fiat model to- 
wards zero. 

A third effect can also be seen in Fig. [HI a fairly large 
number of chains no longer escape to «'""'* . Since we reverse 
the fluctuation sign in one element of each of 50 antipodal 
pairs, it is clear that this does not only affect antipodal cor- 
relations, it can also create correlations at several different 
scales, which did not exist in the original simulation. This 
can create highly artifical attractors for the MCMC chains. 

The combination of these effects does not yield a distri- 
bution of <f> which looks uniform by inspection of Figs ll8l and 
1191 A Kolmogorov-Smirnov two-sided test comparing the cj) 
distribution to a uniform distribution on [— tt, tt] yields a 
probability of 16%, i.e. the distribution is only marginally 
consistent with a uniform distribution. Clearly, although 
the non-uniform distribution of cf) is clearly infiuenced by 
the antipodal anti-correlation, the latter alone is insufficient 
to explain it. 

Given that the observational map does include an an- 
tipodal anti-correlation, and possibly other features con- 
tributing to the non-uniform expected distribution of 0, to 
what degree should the optimal cross-correlations favour (j) 
close to zero, and what shape should the distribution of (f) 
take? For example, if a^ as defined above is estimated on 
the interval (without periodicity) [— 7r,7r), then how small 
should it be? 

The simulations are generated from spherical harmon- 
ics using the spherical harmonic mean coefficients Ci of 
Hinshaw et al. (2007), obtained from the WMAP data, but 
with random phases (and simulated noise) . We choose those 
of the simulations with the lowest S'j estimates. These sim- 
ulations contain much of the same information that is in 



the observational data set. After all, this is the point of 
simulations. 

We know that the MCMC chains in the data with 
the correct phases favour (f) — -t-7r/5, and have an anti- 
correlation at 2r'sLS • Randomising the phases of the spher- 
ical harmonics while retaining the C/ values should yield 
many statistical properties that are similar to that of the 
map with the correct phases. So, the simulated maps must 
statistically allow (j) to be at least as high as 7r/5, but do not 
necessarily have to allow it to be much higher. Since the ran- 
dom phases have no way of favouring right-handed twists 
over left-handed twists, a distribution centred at </> = and 
including values roughly up to ±7r/5 is consistent with the 
information that the simulations should statistically con- 
tain. However, the reason why the distribution does not 
extend to optimal twists with \(j)\ ^ 7r/5 remains an open 
question for future work. 



6. Conclusion 

By use of some spherical trigonometry, it is possible to 
speed up the Markov Chain Monte Carlo cross-correlation 
method of testing a cosmic topology hypothesis described 
in RBSG08 by a factor of about 3-10. This is shown in 
Fig. [1] and Eq. ^. This could, in principle, make it prac- 
tical to make calculations at higher resolution than be- 
fore. However, the physical interpretation of the calcula- 
tions would be ambiguous because of several systematic 
effects listed above. 

Moreover, for low matched circle sizes and low maxi- 
mum pair separations, use of Eq. (O can increase the num- 
bers of pairs per separation bin, thereby decreasing the 
noise. A further improvement in the method and the re- 
lation between this method as a test of "matched annuli" 
and tests of matched circles are discussed above in Sect. 15. ll 

Applying the faster method to the ILC and TOH ver- 
sions of the WMAP five-year data, we find little significant 
change in the best estimate parameters for a Poincare do- 
decahedral space model of the Universe, compared to those 
given in RBSG08. Depending on the minimum pseudo- 
probability level used in the MCMC chains that is used 
for estimating the twist (p, the optimal value of (/) is a few 
degrees greater and a few degrees less than -|-36.0° in the 
ILC and TOH maps respectively (Table 121). 

We also applied the faster method to a small num- 
ber of simulated skies. The WMAP observations confirmed 
the generic cosmic topology prediction (Starobinsky 1993; 
Stevens et al. 1993) of a cutoff in structure statistics such as 
the temperature-temperature fluctuation auto-correlation 
on large angular or spatial scales. Here, we estimated the 
weakness of the large scale auto-correlations using a statis- 
tic Ss_ [Eq. dH)] similar to that of Spergel et al. (2003). The 
low observed value of Sj implies that cross-correlations on 
these scales should usually be weak, so that MCMC chains 
as used here should have difficulty finding a region of pa- 
rameter space with high optimal cross-correlations. Hence, 
to conservatively test the infinite, flat, concordance model 
hypothesis, on the assumption that the real observations 
have a low value of S^ due to being a single realisation 
of a random process, we analysed simulated skies which 
use the Hinshaw et al. (2007) C; estimates and randomised 
phases of the spherical harmonics. The observational map 
was calculated from the three least contaminated frequency 
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WMAP bands, Q, V and W, and the Gaussian random fluc- 
tuation simulations were created using an equivalent anal- 
ysis pipeline (Lew & Roukema 2008). 

For simulated and observed skies with the kp2 Galactic 
cut, these simulations using the Hinshaw et al. (2007) Ci 
estimates generally yield overestimates of S^ for the cut sky. 
Since we prefer to analyse the cut sky in order to minimise 
galactic contamination effects, it was necessary to select 
those simulations with the lowest values of S^, so that the 
simulations were statistically as similar as possible to the 
observations outside of the kp2 cut. 

The results of running MCMC chains on the 20 simula- 
tions with the lowest 5"^ estimates from among 50 simula- 
tions were that only two simulations gave optimal cross- 
correlation solutions which did not escape to the lower 
limit in circle size a, where small number statistics favour 
fortuitous cross-correlations. This indicates a conditional 
probability of finding a non-noise solution of about 10% 
[Eq. CID].^ 

The distribution of optimal twists (p from the simu- 
lations showed that despite the low value of S'j, the ex- 
pected distribution of (j) in the simulations was not uni- 
form. Instead, it is consistent with a Gaussian distribution 
centred near (/) = 0, of width ^ 33-38°. Possible reasons 
for the nature of this distribution are discussed above, in 
Sections [OT] and [5X2l 

Assuming this numerical, Gaussian fit to the expected 
(for an infinite, flat model) distribution of (j), and using 
the estimate of (j) obtained from the observational map 
(Table [5]) that is most discrepant from 7r/5, we found that 
(j) could be expected to be as close to ±7r/5 as the obser- 
vational value is close to -f-7r/5 with a probability of about 
22% [Eq. inD]. 

Both of these probabilities are conditional on S^ being 
low, which itself is unusual for an infinite, flat model (e.g., 
Spergel et al. 2003; Efstathiou 2004). Hence, for an infinite, 
flat, cosmic concordance model with Gaussian random fluc- 
tuations, we find that comparing an observational map with 
simulational maps gives an estimate of the chance of find- 
ing both (a) a large scale autocorrelation as weak as that 
observed, and (b) a PDS-like, optimal cross-correlation sig- 
nal similar to that observed to be about 0.015% or 1.25% 
for the Spergel et al. (2003) or Efstathiou (2004) estimates 
of the probability of low S^ respectively [Eq. (|16p ]. 
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